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ABSTRACT 

We  sequenced  the  complete  genome  of  Bacillus 
cereus  ATCC  10987,  a  non-lethal  dairy  isolate  in  the 
same  genetic  subgroup  as  Bacillus  anthracis. 
Comparison  of  the  chromosomes  demonstrated 
that  B. cereus  ATCC  10987  was  more  similar  to 
B. anthracis  Ames  than  B. cereus  ATCC  14579,  while 
containing  a  number  of  unique  metabolic  capabili¬ 
ties  such  as  urease  and  xylose  utilization  and  lack¬ 
ing  the  ability  to  utilize  nitrate  and  nitrite. 
Additionally,  genetic  mechanisms  for  variation  of 
capsule  carbohydrate  and  flagella  surface  struc¬ 
tures  were  identified.  Bacillus  cereus  ATCC  10987 
contains  a  single  large  plasmid  (pBc10987),  of 
-208  kb,  that  is  similar  in  gene  content  and  organ¬ 
ization  to  B.anthracis  pXOI  but  is  lacking  the  patho¬ 
genicity-associated  island  containing  the  anthrax 
lethal  and  edema  toxin  complex  genes.  The  chromo¬ 
somal  similarity  of  B. cereus  ATCC  10987  to 
B.anthracis  Ames,  as  well  as  the  fact  that  it  contains 
a  large  pXOI -like  plasmid,  may  make  it  a  possible 
model  for  studying  B.anthracis  plasmid  biology  and 
regulatory  cross-talk. 

INTRODUCTION 

Bacillus  cereus.  Bacillus  thuringiensis  and  Bacillus  anthracis 
all  belong  to  the  B.  cereus  sensu  lato  group  of  rod-shaped, 
Gram-positive,  spore-forming  bacteria  (1).  Bacillus  anthracis 
is  the  etiological  agent  of  anthrax,  an  acute  fatal  animal  and 
human  disease  that  was  employed  as  a  bioterror  agent  in  the 


autumn  of  2001  (2).  Bacillus  anthracis  shares  a  very  close 
evolutionary  relationship  with  two  other  common  but  much 
less  pathogenic  bacterial  species:  B. thuringiensis,  a  well 
known  biological  insecticide  (3),  and  B. cereus,  often  con¬ 
sidered  at  most,  a  soil-dwelling  opportunistic  pathogen  (1). 
There  are  rare,  usually  non-fatal  diseases  associated 
with  B. cereus,  such  as  endophthalmitis  after  trauma  to  the 
eye  (4,5)  and  two  forms  of  human  food  poisoning, 
characterized  by  either  diarrhea  and  abdominal  distress  or 
nausea  and  vomiting  (1,6).  However,  more  serious  infections 
in  immunocompromised  individuals  have  been  observed 
(7-10),  and  some  B. cereus  isolates  have  been  implicated  in 
a  lethal  infection  similar  in  clinical  presentation  to 
B.anthracis,  posing  a  potential  public  health  issue  (11). 
Many  of  the  species-specific  phenotypes  of  this  group  are 
encoded  by  plasmid  genes,  such  as  the  B.anthracis  lethal  toxin 
complex  and  poly-D-glutamic  acid  capsule  (plasmids  pXOI 
and  pX02,  respectively)  (12,13),  and  the  B. thuringiensis 
insect  toxins  (14). 

Bacillus  cereus,  B. thuringiensis  and  B.anthracis  are 
genetically  similar  to  an  extent  that  comparisons  of  16S 
rRNA  sequences  (15)  or  16S-23S  rRNA  spacer  regions  (16) 
cannot  adequately  distinguish  between  the  members  of  this 
group.  There  is  no  consensus  on  whether  these  bacteria  should 
be  separate  species  or  considered  specialized  variants  of  a 
single  species  (17).  Additionally,  B.anthracis  has  been  shown 
to  be  one  of  the  most  monomorphic  bacterial  species  (17,18). 
A  number  of  molecular  typing  schemes  have  been  applied  to 
distinguish  individuals  within  the  group,  including  pulsed-field 
gel  electrophoresis  (PFGE)  (19),  amplified  fragment  length 
polymorphism  (AFLP)  (20,21),  multi-locus  enzyme  electro¬ 
phoresis  (MLEE)  (17,19,22,23)  and  multi-locus  sequence 
typing  (MLST)  (18).  From  this  body  of  work,  it  is  apparent  that 
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a  group  of  B.cereus  and  B.thuringiensis  isolates  are  more 
closely  related  to  B.anthracis  than  strains  represented  by  the 
B.cereus  species  type  strain  ATCC  14579  that  was  sequenced 
recently  (24)  (Fig.  1).  Bacillus  cereus  ATCC  10987  was 
chosen  for  sequencing  as  it  was  a  widely  available  strain,  has 
been  shown  by  MLEE  and  other  studies  to  be  closely  related  to 
B.anthracis  (17,22,25)  and  contained  genes  similar  to  those 
found  on  pXOl  (26).  These  features  made  the  strain  a  useful 
addition  to  the  comparative  genomic  analysis  of  B.anthracis. 

Bacillus  cereus  ATCC  10987  was  isolated  from  a  study  on 
cheese  spoilage  in  Canada  in  1930  (27,28).  It  has  been 
demonstrated  to  contain  putative  virulence  factors  such  as 
phosphatidylinositol-specific  phospholipase  C  (PI-PLC), 
phosphatidylcholine-preferrring  phospholipase  C  (PC-PLC), 
sphingomyelinase,  non-hemolytic  enterotoxin  and  proteases 
(29,30),  and  to  express  a  high  level  of  phospholipase  C 
(A.-B.Kolst0.  unpublished  data). 

The  present  study  compares  the  chromosomes  of  B.cereus 
ATCC  10987,  B.cereus  ATCC  14579  and  B.anthracis  Ames, 
and  reveals  a  number  of  metabolic  pathways  not  identified 
previously  in  the  B.cereus  group  of  organisms,  such  as  urease 
and  xylose  utilization,  as  well  as  potential  mechanisms  for 
antigenic  variability  of  surface  structures  including  capsule 
and  flagella.  Additionally,  we  identify  a  single  large  plasmid 
in  B.cereus  ATCC  10987  that  is  similar  to  the  B.anthracis 
pXOl  plasmid,  and  encodes  a  number  of  unique  potential 
pathogenicity  and  resistance  factors  as  well  as  conserved 
regulatory  proteins. 

MATERIALS  AND  METHODS 

Sequencing  of  the  B.cereus  ATCC  10987  genome 

The  random  shotgun  method,  and  cloning,  sequencing  and 
assembly  were  as  described  previously  (26).  Large  (10-12  kb) 
and  small  (2. 5-3. 5  kb)  insert  random  sequencing  libraries 
were  sequenced  for  this  genome  project  with  success  rates 
of  84  and  87%  and  average  high-quality  read  lengths  of 
666  and  683  nt,  respectively.  The  completed  genome 
sequence  contained  23  042  and  57  171  reads  from  the  large 
and  small  libraries,  respectively,  achieving  an  average  of 
10.4-fold  sequence  coverage  per  base.  After  assembly, 
gaps  between  contigs  were  closed  by  editing,  walking 
library  clones  and  linking  assemblies  by  PCR.  The  Glimmer 
Gene  Finder  (31)  was  utilized  to  identify  potential  coding 
regions,  and  annotation  was  completed  as  described  previ¬ 
ously  (32).  The  sequences  of  B.cereus  ATCC  10987  genome 
and  plasmid  can  be  accessed  using  the  GenBank  accession  nos 
AE017194  and  AE017195,  respectively.  An  estimate  of  the 
copy  number  of  the  plasmid  was  obtained  by  dividing  the 
coverage  depth  of  the  plasmid  by  the  coverage  depth  of  the 
chromosome. 

BLAST  score  ratio  analysis  (BSRA) 

The  BSRA  is  a  modification  of  the  technique  described  by 
Read  et  al.  (33).  For  each  of  the  predicted  proteins  of  B.cereus 
ATCC  10987,  we  obtained  a  BLASTP  raw  score  (34)  for  the 
alignment  against  itself  (REF_SCORE)  and  the  most  similar 
protein  (QUE_SCORE)  in  each  of  the  genomes  of  B.cereus 
ATCC  14579  (24)  and  B.anthracis  Ames  (26).  These  scores 
were  normalized  by  dividing  the  QUE_SCORE  obtained  for 


B.  thuringiensis 


Figure  1.  Phylogenetic  tree  of  members  of  the  B.cereus  sensu  lato 
group.  The  tree  was  based  on  partial  nucleotide  sequences  of  seven  house¬ 
keeping  genes  totaling  2977  nt  (18)  and  was  built  using  the  neighbor-joining 
method  (77)  applied  to  a  matrix  of  pairwise  differences  among  sequences. 
The  scale  bar  is  an  average  number  of  nucleotide  differences  per  site. 
Strains  labeled  with  a  ‘p’  were  obtained  from  human  patients,  ‘s’  were 
isolated  from  soil,  and  ‘d’  indicates  a  dairy  origin.  All  B.anthracis  strains 
analyzed  {n  =  5)  fall  in  the  same  location  on  the  tree  and  thus  no  strain  has 
been  designated.  Bacillus  cereus  clinical  isolates  were  previously  described 
(18,22). 


each  query  genome  protein  by  the  REF_SCORE.  Proteins  with 
a  normalized  ratio  of  less  than  0.4  were  considered  to  be  non- 
homologous.  The  normalized  BLAST  score  ratio  of  0.4  is 
generally  similar  to  two  proteins  being  -30%  identical  over 
their  entire  length. 

BLAST  score  ratios  were  plotted  as  x,  y  coordinates  as 
shown  in  Figure  3.  Each  protein  in  the  reference  genome 
{B.cereus  ATCC  10987)  was  grouped  according  to  its  scores 
in  each  of  the  query  genomes,  and  colored  accordingly: 
yellow,  unique  to  the  reference;  red,  common  to  all  three; 
cyan,  common  between  B.anthracis  Ames  and  the  reference, 
but  absent  in  B.cereus  ATCC  14579;  blue,  common  between 
B.cereus  ATCC  14579  and  the  reference,  but  absent  in 
B.anthracis  Ames. 

PCR  screening  of  a  set  of  B.cereus  group  organisms 

PCR  was  used  to  screen  a  23  strain  set  of  B.cereus  group 
organisms  (Supplementary  table  S 1  available  at  NAR  Online) 
for  the  presence  or  absence  of  integral  genes  of  urease,  capsule 
or  xylose  pathways  using  the  primers  described  in 
Supplementary  table  S2. 
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Table  1.  General  features  of  the  B.cereus  group  genomes 


Feature 

B.cereus  ATCC  10987 
Chromosome  Plasmid 

pBc 10987 

B.anthracis  Amesa 
Chromosome 

Plasmid 

pXOl 

Plasmid 

pX02 

B.cereus  ATCC  14579b 
Chromosome  Plasmid 

pBClinl5 

Size  (bp) 

5  224  2B3 

208  369 

5  227  293 

181  677 

94  829 

5  426  909 

15  100 

No.  of  genes 

5642 

242 

5508 

217 

113 

5366 

21 

Percentage  coding 

85 

80.9 

84.3 

77.1 

76.2 

84 

84 

Average  gene  length  (nt) 

787 

696 

800 

645 

639 

820 

820 

%G  +  C  content 

35.6 

33.5 

35.4 

32.5 

33 

35.3 

38 

rRNA  operons 

12 

0 

11 

0 

0 

13 

0 

tRNAs 

98 

0 

95 

0 

0 

108 

0 

sRNAs 

3 

0 

3 

0 

0 

3 

0 

Putative  phage 

3 

0 

4 

0 

0 

6 

Yes 

Genes  with  assigned  function 

2802 

94 

2762 

65 

38 

3839 

4 

Conserved  hypothetical  genesc 

1074 

74 

1212 

22 

19 

1481 

4 

Genes  of  unknown  function0 

672 

12 

657 

8 

5 

142 

6 

Hypothetical  genes0 

1094 

62 

877 

122 

51 

? 

11 

aThe  genome  data  for  B.anthracis  are  derived  from  the  complete  genome  sequence  of  B.anthracis  Ames  (26)  and  the  plasmid  sequences  of  B.anthracis  str. 
A2012  (Florida)  (13).  Note  that  the  number  of  genes  is  not  the  same  as  the  number  of  proteins  used  in  calculation  of  Figure  4  due  to  the  presence  of 
frameshifts. 

bBacillus  cereus  ATCC  14579  data  were  obtained  from  Ivanova  et  al.  (24). 

cFor  clarification,  a  hypothetical  protein  has  no  other  homologous  protein  in  any  of  the  public  databases,  genes  of  unknown  function  have  similarity  to  broad 
protein  families  but  no  specific  function  can  be  assigned,  whereas  a  conserved  hypothetical  protein  has  one  or  more  homologs  in  another  species;  however,  no 
function  has  yet  been  attributed  to  any  of  these  proteins  to  date. 


RESULTS 

Whole  genome  analysis  shows  that  B.cereus  ATCC 
10987  is  more  closely  related  to  B.anthracis  Ames  than 
B.cereus  ATCC  14579 

Bacillus  cereus  ATCC  10987,  the  third  complete  genome 
sequence  in  the  B.cereus  group,  has  broad  similarities  to  the 
B.anthracis  Ames  (26)  and  B.cereus  ATCC  14579  (24) 
genome  sequences  (Table  1).  The  B.cereus  ATCC  10987 
chromosome  shares  a  high  degree  of  synteny  (conserved  gene 
order)  with  the  B.anthracis  Ames  and  B.cereus  ATCC  14579 
chromosomes  (Fig.  2A  and  B).  Direct  comparison  of  the 
complete  nucleotide  sequences  using  NUCmer  (35)  reveals 
that  B.anthracis  and  B.cereus  ATCC  10987  are  93.94% 
identical  whereas  B.cereus  ATCC  14579  and  B.cereus  ATCC 
10987  are  90.94%  identical.  Additionally,  the  proteins  of 
B.cereus  ATCC  10987,  when  analyzed  by  BRSA,  are  more 
similar  to  those  of  B.anthracis  Ames  than  those  of  B.cereus 
ATCC  14579  (Fig.  3).  This  close  relationship  between 
B.anthracis  Ames  and  B.cereus  ATCC  10987  is  confirmed 
in  a  phylogenetic  tree  based  on  seven  partially  sequenced 
genes  used  in  MLST  analysis  of  the  B.cereus  group  (18) 
(Fig.  1)  and  is  in  line  with  the  results  of  previous  MLEE 
studies  (17). 

We  identified  B.cereus  ATCC  10987  proteins  without 
significant  homology  (BLAST  score  ratio  less  than  0.4)  in  the 
other  two  B.cereus  group  proteomes  using  BSRA.  These 
proteins  will  be  referred  to  as  ‘novel’  herein.  The  relative 
chromosomal  location  of  the  novel  proteins  is  shown  in 
Figure  4A.  There  are  also  a  significant  number  of  proteins  that 
have  homologs  in  only  two  of  the  three  chromosomes  [i.e.  in 
B.cereus  ATCC  10987  and  B.cereus  ATCC  14579  but  not 
B.anthracis  Ames  (Fig.  4A,  Table  2)],  suggesting  a  history  of 
insertion  and/or  deletion  in  the  evolution  of  the  B.cereus 
group.  In  many  cases,  genes  found  at  a  specific  position  in  one 
genome  are  replaced  with  others  at  the  corresponding  locus  in 
another  (for  examples  see  Figs  5  and  6,  and  Supplementary  fig. 


SI).  Since  we  do  not  want  to  make  untestable  assumptions 
about  the  history  of  these  events,  we  will  use  the  neutral  term 
‘replacement’  to  describe  them  herein.  Table  2  provides  a  list 
of  the  replacements  in  each  of  the  three  genomes.  These  loci 
are  often  associated  with  strain-specific  phenotypes,  and 
several  examples  will  be  discussed  in  the  following  analysis. 

The  208  kb  plasmid  in  B.cereus  ATCC  10987  has 
similarities  to  B.anthracis  pXOl 

The  B.cereus  ATCC  10987  plasmid,  pBc  10987,  is  208  369  nt 
in  length  encoding  242  genes  (Table  1).  pBcl0987  was 
compared  with  other  sequenced  large  plasmids  of  the  B.cereus 
group,  B.anthracis  pXOl  (-182  kb),  pX02  (-95  kb)  and 
B.thuringiensis  subsp.  israeliensis  pBtoxis  (-128  kb)  (36) 
using  BSRA.  pBcl0987  and  pXOl  show  little  similarity  to  the 
pBtoxis  proteome  (Fig.  4B)  and  even  less  to  the  pX02 
proteome  (only  five  pX02  proteins  were  conserved  with  either 
pBc  10987,  pXOl  or  pBtoxis).  Comparison  of  pBc  10987  and 
pXOl  revealed  that  -65%  of  proteins  were  homologous  and 
-50%  were  in  a  syntenic  location  (Fig.  2C)  and  the  relative 
transcriptional  direction  of  many  of  the  pXOl  and  pBc  10987 
genes  has  been  retained,  representing  a  conserved  ‘plasmid 
backbone’.  Comparison  of  the  nucleotide  sequences  of  the 
plasmids  using  NUCmer  (35)  reveals  that  pBcl0987  and 
pXOl  are  -40%  identical,  whereas  pBc  10987  and  pBtoxis  are 
only  -7%  identical.  Based  on  nucleotide  and  protein  simil¬ 
arity,  it  appears  that  pBc  10987  and  pXOl  may  be  members  of 
a  group  of  low-copy  number  plasmids  (pBc  10987  approxim¬ 
ately  one/cell;  pXOl  approximately  three  copies/cell)  that 
may  also  include  another  sequenced  plasmid,  pBtoxis  from 
B.thuringiensis  subsp.  israeliensis,  as  a  distant  relative.  Lack 
of  knowledge  regarding  the  replication  machinery  in  these 
plasmids  precludes  us  from  conclusively  grouping  these 
plasmids  together;  however,  the  replication  mechanism  is 
different  from  those  employed  by  B.anthracis  pX02  (12), 
B.thuringiensis  pAW63  (37)  or  small  B.thuringiensis  plasmids 
(38). 
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The  genetic  basis  for  replication,  maintenance  and  mobil¬ 
ization  of  pXOl  is  unknown  (39,40),  suggesting  a  unique 
mechanism  that  may  be  conserved  in  pBc  10987  based  on 
the  level  of  conservation  of  these  plasmids.  pBc  10987 
BCEA0008-BCEA0073  are  most  similar  in  composition  and 
order  to  pXOl  BXA0064-BXA0120  encoding  conserved 
hypothetical,  membrane-associated  and  conjugative  transfer¬ 
like  proteins  of  other  plasmid  systems,  such  as  the  TraD/G 
family  protein  (BCEA0072).  This  region  also  contains 
proteins  that  are  conserved  to  a  lesser  degree  in  pBtoxis, 
suggesting  that  it  may  be  required  in  the  basic  maintenance  of 
these  plasmids.  The  similarity  between  pBcl0987  and  pXOl 
also  extends  into  a  number  of  replication-related  proteins 
including  a  type  I  DNA  topoisomerase  (BCEA0140, 
BXA0213,  respectively)  which  is  thought  to  aid  in  the 
stability  of  these  plasmids  (41).  Only  pBc  10987  contains  a 
unique  plasmid-encoded  DNA  polymerase  111  (3  subunit, 
involved  in  tight  association  of  the  template  DNA  with  the 
polymerase  complex  (42),  which  may  ensure  that  the  plasmid 
is  replicated  at  an  increased  processivity  and  stability. 

The  B.anthracis  pXOl  plasmid  pathogenicity  region  con¬ 
taining  the  genes  encoding  the  transcriptional  regulator  AtxA, 
lethal  factor,  protective  antigen  and  edema  factor  is  absent 
from  pBcl0987,  but  this  region  has  been  replaced  by  a 
B.cereus  ATCC  10987  island  containing  a  copper-requiring 
tyrosinase,  amino  acid  transport  system,  arsenate  resistance 
gene  cluster  and  regulatory  proteins  (gray  box  in  Fig.  2C). 
pBcl0987  also  includes  two  novel  potential  toxins: 
BCEA0165,  a  MIP  family  channel  protein,  and  BCEA0203, 
a  possible  metalloprotease.  However,  the  pBc  10987  island  is 
not  flanked  by  any  mobile  genetic  elements  that  are  thought  to 
have  been  involved  in  the  integration  of  the  pathogenicity 
island  on  pXOl  (39).  The  species-specific  pathogenicity- 
associated  islands  are  the  most  variable  portions  of  the 
plasmids  as  none  of  the  proteins  in  either  island  are  shared 
between  plasmids  (Figs  2C  and  4B). 

Another  interesting  similarity  between  pXOl  and 
pBc  10987  that  may  affect  the  phenotype  is  the  presence  of  a 
plasmid-borne  abrB  gene.  AbrB  is  a  pleiotropic  transition 
state  regulatory  protein  that  has  been  shown  to  negatively 
regulate  the  expression  of  the  lethal  toxin  genes  in  B.anthracis 

(43) .  Additionally,  it  has  been  demonstrated  that  AbrB  in 
conjunction  with  SpoOA  are  the  major  regulatory  factors  in  the 
developmental  pathways  of  spores  and  biofilms  in  B.subtilis 

(44) .  In  B.cereus  ATCC  10987,  as  in  B.anthracis  Ames,  there 
are  two  divergent  chromosomal  copies  of  abrB  as  well  as  a 
plasmid  copy  of  this  regulator,  whereas  B.cereus  ATCC  14579 


Figure  2.  Plot  of  BSRA  based  on  genomic  location.  Each  triangle  indicates 
a  single  protein  plotted  on  the  5'  end  of  the  coding  region  in  the  reference 
genome  and  the  best  match  in  the  query  organism.  They  are  color  coded  as 
follows:  normalized  score  3*0.95,  red;  normalized  score  3*0.80  and  <0.95, 
orange;  normalized  score  3=0.60  and  <0.80,  yellow;  normalized  score  30.40 
and  <0.60,  green;  normalized  score  30.20  and  <0.40,  cyan;  normalized 
score  30.0  and  <0.20,  blue.  (A)  Comparison  of  B.cereus  ATCC  10987 
(reference)  and  B.anthracis  Ames  (query).  (B)  Comparison  of  B.cereus 
ATCC  10987  (reference)  and  B.cereus  ATCC  14579  (query).  (C)  Com¬ 
parison  of  B.cereus  ATCC  10987  pBc  10987  (reference)  and  B.anthracis 
Ames  pXOl  (query).  The  shaded  region  indicates  the  location  of  the 
pathogenicity-associated  island  in  pXOl  that  shows  no  similarity  to  the 
proteins  of  pBc  10987. 
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BLAST  Score  Ratio  Analysis  of  B.  anthnu  <*  Ames  and  B.  i  emu  ATCC  14579 
compared  to  B.  cerem  ATCC  10987 


oa 


B.  i  ereus  ATCC  14579 

Figure  3.  BLAST  score  ratio  analysis  between  the  three  genomes.  Each  point  represents  a  single  protein.  Proteins  that  cluster  in  the  upper  right  hand  comer 
(red)  are  indicative  of  coding  regions  that  are  similar  between  all  three  organisms.  Proteins  in  the  lower  right  (blue)  are  those  that  are  conserved  among 
B.cereus  ATCC  10987  and  B.cereus  ATCC  14579  but  are  not  shared  with  B.anthracis',  upper  left  (cyan)  are  similar  between  B.cereus  ATCC  10987  and 
B  anthracis\  and  lower  left  are  unique  to  B.cereus  ATCC  10987  (yellow).  Proteins  that  are  between  these  extreme  regions  are  thought  to  be  evolving.  The 
inset  evaluates  the  BLAST  score  ratio  of  the  genes  that  are  highly  similar  between  the  three  B.cereus  group  genomes.  The  slope  of  the  linear  regression  is 
1.0139,  which  indicates  that  B.cereus  ATCC  10987  proteins  are  generally  more  similar  to  B.anthracis  Ames  proteins  than  B.cereus  ATCC  14579. 


contains  only  the  two  chromosomal  copies  (Supplementary 
fig.  S2).  The  two  chromosomal  copies  of  abrB  in  B.cereus 
ATCC  10987  (BCE2077  and  BCE0035)  are  each  >97% 
identical  to  the  chromosomal  abrB  orthologs  in  B.anthracis 
Ames /B.cereus  ATCC  14579  (BA2000/BC1996  and  BA0034/ 
BC0042),  but  the  paralogs  are  divergent  (-67%  nucleotide  and 
-50%  amino  acid  identity;  Supplementary  fig.  S2).  Addition¬ 
ally,  the  plasmid-encoded  copies  of  AbrB  are  most  similar  to 
each  other  (-80%  amino  acid  identity)  and  are  as  similar  to  the 
chromosomal  copies  of  this  protein  as  the  paralogous 
chromosomal  copies  are  to  each  other.  It  has  been  previously 
noted  that  the  B.anthracis  pXOl  copy  of  AbrB  was  truncated 
by  27  amino  acids  in  comparison  with  chromosomal  copies 
(43,45).  This  is  not  the  case  for  the  pBcl0987-encoded  AbrB. 
Even  though  B.cereus  ATCC  10987  lacks  the  lethal  toxin 
genes,  the  conservation  of  the  AbrB  homologs  on  the  large 
plasmids  suggests  a  possible  role  in  expression  of  plasmid- 
encoded  factors.  In  B.anthracis,  the  regulatory  activity  was 
attributed  to  a  chromosomal  encoded  copy  of  AbrB  (43,45); 
however,  the  role  of  pBc  10987  AbrB  is  unclear. 

Bacillus  cereus  ATCC  10987  may  prove  to  be  a  convenient 
non-lethal  ‘model’  organism  for  studying  B.anthracis  plasmid 
biology  issues  such  as  plasmid  replication,  maintenance  and 
transfer  as  well  as  regulatory  cross-talk  between  chromosome 
and  plasmid. 

The  pBcl0987  plasmid  sequence  allows  further 
identification  of  genes  in  flux  between  B.cereus  group 
chromosomes  and  plasmids 

As  B.cereus  group  plasmids  and  phages  are  identified  and 
sequenced,  it  is  becoming  apparent  that  many  genes  located  on 
the  chromosome  are  actually  homologs  of  episomal  deter¬ 


minants.  An  example  of  likely  genetic  exchange  between  the 
plasmid  and  chromosome  discovered  in  B.cereus  ATCC 
10987  are  the  two  identical  copies  of  a  3605  nt  Tn554 
element  encoded  by  BCE3147-BCE3149  and  BCEA0242, 
BCEA0001-2.  The  Tn 554  element  is  composed  of  three 
essential  proteins,  TnsABC,  whose  closest  relative  is  the 
Tn554  from  Staphylococcus  aureus  (46).  There  are  four  other 
potential  coding  regions  that  are  conserved  in  association  with 
the  transposable  element  (BCE3150-53/BCEA0003-6).  Most 
interesting  in  this  group  of  proteins  is  BclA,  which  has  been 
shown  to  be  the  major  spore  surface  antigen  of  B.cereus  and 
B.anthracis  (47-49)  as  well  as  an  exosporium  depth  deter¬ 
minant  in  B.anthracis  (50).  Bacillus  anthracis  Ames  and 
B.cereus  ATCC  14579  each  contain  only  a  single  chromo¬ 
somal  copy  of  BclA,  whereas  B.cereus  ATCC  10987  contains 
both  a  chromosomal  copy  and  a  plasmid  copy.  A  BclA-like 
protein,  Bcol  (Bacillus  collagen-like),  has  been  recently 
identified  on  small  (<12  kb)  B.thuringiensis  plasmids  (38), 
but  no  functional  role  could  be  assigned.  It  is  possible  that 
BclA  proteins  affecting  spore  morphology  and  surface 
properties  may  be  horizontally  transferred  among  B.cereus 
group  bacteria  as  part  of  the  Tn554  transposable  element. 

The  gerX  operon,  located  on  the  B.anthracis  pXOl  plasmid, 
but  on  the  chromosome  of  B.cereus  ATCC  10987,  represents 
another  example  of  genes  moving  between  replicons  in  the 
B.cereus  group.  In  B.anthracis,  this  operon  has  been  demon¬ 
strated  to  be  required  for  virulence  and  germination  in  a  mouse 
anthrax  model  (51).  Comparison  of  the  other  germination 
proteins  [gerH,  gerL,  gerK,  gerP,  gerS  and  gerY  operon  gene 
products;  B.cereus  ATCC  10987  lacks  the  B.  cereus- specific 
gerQ  operon  (52)]  among  the  three  B.cereus  group  bacteria 
shows  a  high  level  of  conservation  (>90%  amino  acid 
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Figure  4.  Venn  diagram  illustrating  the  number  of  putative  proteins  associated  with  each  organism  and  the  number  shared  with  the  intersecting  organism. 
Tick  marks  that  are  on  each  circle  represent  the  location  of  the  unique  proteins  (BLAST  score  ratio  less  than  0.4)  on  the  genome.  (A)  Chromosomal 
comparison:  blue,  B. anthracis  Ames;  green.  B.cereus  ATCC  10987;  red,  B.cereus  ATCC  14579.  (B)  Plasmid  comparison:  blue,  B. anthracis  Ames  pXOl; 
green,  B.cereus  ATCC  10987  pBcl0987;  magenta,  B.thuringiensis  subsp.  israeliensis  pBtoxis. 


identity).  The  gerX  operon  is  not  present  in  B.cereus  ATCC 
14579,  whereas  the  gerX-encoded  proteins  from  B.cereus 
ATCC  10987  and  B. anthracis  Ames  share  a  lesser  degree  of 


identity  than  the  proteins  encoded  by  other  germination 
operons  (-67%  amino  acid  identity).  It  is  unclear  whether  the 
gerX  operons  should  be  considered  as  true  orthologs,  and  only 
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Table  2.  Major  phenotypic  differences  and  characteristics  of  B.cereus  group  genomes 


Genotype 

B.  anthracis  Ames 

B.cereus  ATCC  10987 

B.  cereus  ATCC  14579 

Plasmid 

pXOl  (189  kb) 

pBc 10987  (208  kb) 

Linear  phage- like  (15  kb) 

Tripartite  lethal  toxin 

pX02  (96  kb) 

Present 

Absent 

pBClinl5 

Absent 

rRNA 

11  copies 

12  copies 

13  copies 

Urease  gene  cluster 

Absent 

Present 

Absent 

Xylose  utilization  genes 

Absent 

Present 

Absent 

Polysaccharide  capsule 

Absent 

Present" 

Present" 

Flagellar  genes 

Present15 

Present0 

Present0 

Tagatose  utilization  genes 

Absent 

Present 

Partial11 

Functional  PlcR 

Absente 

Present0 

Present0 

Mobile  genetic  elements 

Phage' 

4 

4  (1  degenerate) 

6  (1  linear  plasmid) 

Tn554 

0 

2f 

0 

Tn  7 

0 

26 

0 

15605 

7 

9 

5 

15231 

3 

5 

1 

Arginine  deiminase  genes 

Absent 

Present 

Present 

B.cereus  repeat  1  (bcrl)k 

12  copies 

72  copies 

52  copies 

Restriction  enzymes 

Absent 

Present11 

Present*1 

Competence  genes 

Present* 

Present1 

Present1 

aThe  capsule  biosynthesis  region  of  ~20  kb  is  partially  similar  in  both  B.cereus  genomes  (Fig.  7). 

bBacillus  anthracis  Ames  contains  a  non-functional  flagellar  operon  with  genes  BA1661,  BA1670,  BA1677  and  BA1709  containing  frameshifts  preventing 
the  assembly  of  a  functional  flagellar  structure.  The  flagellar  locus  also  contains  one  flagellin  subunit  encoded  by  gene  BA1706. 

cEach  of  the  B.cereus  genomes  contains  an  apparent  functional  flagellar  operon.  Bacillus  cereus  ATCC  10987  contains  two  divergent  flagellin  subunits, 
whereas  B.cereus  ATCC  14579  contains  four  flagellin  subunits,  three  of  which  are  closely  related  and  one  that  is  divergent  (Supplementary  fig.  S3). 
dThere  are  similar  tagatose  utilization  genes  in  B.cereus  ATCC  14579;  however,  they  are  not  assembled  into  a  potential  transcriptional  unit  and  not  all  of  the 
genes  in  B.cereus  ATCC  10987  are  represented  in  B.cereus  ATCC  14579. 

eplcR  in  B. anthracis  Ames  contains  a  frameshift  that  results  in  a  truncated  and  non-functional  protein.  The  plcR  gene  in  the  B.cereus  strains  is  full  length  and 
apparently  functional  and  can  act  as  a  regulatory  protein. 

fTn554  is  present  in  identical  copies  on  the  plasmid  pBc  10987  and  the  chromosome.  The  adjacent  regions  are  also  duplicated  and  the  total  repeat  is  8309  bp 
and  contains  a  bclA  gene  involved  in  spore  antigen  and  exosporium  determination. 

ST wo  similar  but  unique  Tn7  elements  are  located  on  the  chromosome. 

hThere  are  four  unique  restriction-modification  systems  in  B.cereus  ATCC  10987  and  three  in  B.cereus  ATCC  14579. 

'Many  of  the  required  competence  genes  in  B.subtilis  have  been  identified  in  all  of  the  B.cereus  group  organisms,  but  some  common  genes  are  absent 
(Supplementary  fig.  S4). 

•>The  phages  are  not  conserved  in  sequence  or  genomic  location  in  the  three  genomes  studied. 

kbcrl  is  an  -160  bp  repeated  DNA  sequence  with  unknown  function  (29)  overwhelmingly  over-represented  in  intergenic  regions  of  the  B.cereus  group 
organisms. 
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Figure  5.  Bacillus  cereus  ATCC  10987  urease  operon  replacement.  The  gene  cluster  contains  the  three  structural  genes  encoding  the  urease  enzyme 
{ureCBA\  BCE3664— BCE3662),  genes  for  four  accessory  proteins  responsible  for  insertion  of  Ni+  into  the  apo-enzyme  ( ureEFGD\  BCE3661-BCE3658)  and 
genes  for  two  putative  nickel  transport  proteins  that  are  responsible  for  transporting  Ni+  into  the  cell  ( nixA  and  urel\  BCE3656-BCE3657).  The  red  genes  are 
the  structural  urease  genes,  blue  are  urease  accessory  proteins,  light  blue  are  nickel  transport  proteins,  black  are  hypothetical,  and  green  are  an  unknown  ABC 
transporter  system.  The  urease  cluster  replaces  the  B. anthracis  Ames/ B.cereus  ATCC  14579  genes  that  encode  hypothetical  proteins,  blasticidin  S  deaminase 
and  an  S  layer  protein. 
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Figure  6.  Bacillus  cereus  ATCC  10987  xylose  replacement.  The  xylose  utilization  operon  (green)  in  B. cereus  ATCC  10987  replaces  the  B.anthracis  Ames/ 
B.cereusATCC  14579  nitrate  (red)  reductase  genes,  molybdopterin  synthesis  genes  (blue)  and  nitrite  reductase  (orange).  Black  are  hypothetical  and  conserved 
hypothetical  proteins. 


experimental  evidence  can  determine  if  they  serve  the  same 
function  in  pathogenesis  and  germination. 

Gene  replacements  play  a  major  role  in  metabolic 
specialization  of  B.cereus  ATCC  10987 

One  of  the  most  significant  metabolic  differences  between 
these  three  genomes  is  the  presence  of  a  nine-gene  urease  gene 
cluster  (BCE3656-BCE3664)  in  B.cereus  ATCC  10987,  the 
first  described  in  this  group  of  bacteria.  The  urease  gene 
cluster,  consisting  of  the  urease  structural  ( u  re  ABC )  and 
accessory  proteins  ( ureDEFG )  as  well  as  two  nickel  trans¬ 
porters  (urd  and  nixA),  is  ordered  and  oriented  in  such  a  way 
that  it  may  be  a  transcriptional  unit.  The  gene  cluster  is  part  of 
a  larger  ~11  kb,  15  gene  replacement  (Fig.  5).  At  the 
corresponding  locus,  B.anthracis  Ames  and  B.cereus  ATCC 
14579  have  an  -4.6  kb  region  containing  six  potential  coding 
regions  of  no  obvious  function  (BA3691-BA3696/BC3630- 
BC3635).  The  presence  of  the  urease  enzyme  may  increase 
fitness  of  B.cereus  ATCC  10987  in  acidic  conditions,  much 
the  same  way  that  Helicobacter  pylori  urease  is  required  for 
colonization  of  the  human  stomach  (53). 

Although  no  regulatory  gene  has  been  identified,  we  have 
demonstrated  that  functional  urease  is  produced  by  B.cereus 
ATCC  10987  by  growth  and  color  change  of  Christensen’s 
urea  agar  (54).  Additionally,  using  PCR  to  screen  23  B.cereus 
group  organisms  (Supplementary  table  SI),  we  have  identified 
the  urease  genes  in  one  other  B.cereus  isolate  (n  =10)  and  five 
B.thuringiensis  isolates  (n  =  12),  but  not  in  a  B.mycoides 
isolate  (n  =  1)  (Supplementary  table  S2). 

Another  replacement  unique  to  B.cereus  ATCC  10987 
encodes  the  proteins  responsible  for  the  utilization  of  xylose 
(27).  The  five-gene  xylose  operon  (BCE2208-BCE2214)  is 
located  on  a  unique  10.1  kb  region  that  occupies  the  same 
relative  genomic  location  as  an  -26.6  kb  region  in  B.anthracis 
Ames  and  B.cereus  ATCC  14579  (24,26)  (Fig.  6).  The 
B.anthracis  Ames /B.cereus  ATCC  14579  region  encodes  four 
proteins  that  are  required  for  nitrate  reductase  [a,  (3,  8  and  y 
subunits  (BA2125-8/BC21 18-BC2121)],  a  nitrate  transporter 
(BA2139/BC2128),  a  group  of  proteins  for  the  synthesis  of 
molybdopterin  (BA2133-7/BC2123-7),  utilized  in  nitrogen 


metabolism,  as  well  as  an  [NAD(P)H]  -requiring  nitrite 
reductase  (BA2146-7/BC2136-7).  All  of  the  genes  contained 
in  the  B.anthracis  Ames /B.cereus  ATCC  14579  region  are 
absent  in  B.cereus  ATCC  10987.  The  utilization  of  xylose 
appears  rare  as  PCR  primers  specific  for  the  xylose  permease 
(BCE2208)  failed  to  amplify  the  desired  product  in  any  of 
the  other  B.cereus  group  bacteria  tested  (Supplementary  table 
S2). 

In  B.subtilis,  the  ability  to  reduce  nitrate  and  nitrite  plays  a 
significant  role  in  the  energy  production  under  anaerobic  or 
oxygen-limiting  conditions,  which  utilizes  nitrate  as  a  ter¬ 
minal  electron  acceptor  during  anaerobic  respiration  (55). 
Bacillus  subtilis  mutants  that  lacked  a  functional  nitrate 
reductase  or  molybdopterin  genes  did  not  survive  under 
anaerobic  conditions,  but  were  able  to  survive  in  fermentation 
mixtures  containing  limited  oxygen  (56).  One  other  identified 
nitrite  reductase,  of  a  different  enzyme  class,  is  present  in 
B.cereus  ATCC  10987  (BCE1547),  but  its  role  in  respiration  is 
not  clearly  delineated.  As  for  nitrogen  assimilation,  B.cereus 
ATCC  10987  may  use  ammonia,  a  breakdown  product  of  urea 
hydrolysis.  The  acquisition  of  the  urease  operon  may  be  an 
adaptation  to  the  loss  of  nitrate  and  nitrite  reduction. 
Alternatively,  the  urease  activity  may  allow  the  organism  to 
survive  without  the  ability  to  reduce  nitrate  or  nitrite  and 
hence  can  lose  the  genes  without  decreasing  fitness. 

Bacillus  cereus  ATCC  10987  also  contains  a  17.9  kb 
replacement  responsible  for  the  transport  and  utilization  of  the 
carbohydrate  tagatose  (BCE1896-BCE1912).  The  corres¬ 
ponding  5.0  kb  region  in  B.anthracis  Ames /B.cereus  ATCC 
14579  contains  genes  encoding  hypothetical  proteins  of 
no  described  function.  Bacillus  cereus  ATCC  10987  was 
isolated  from  a  study  on  cheese  spoilage  (27,28)  where  this 
carbohydrate  has  been  found,  hence  the  tagatose  gene  cluster 
may  be  an  adaptation  to  this  carbohydrate-containing 
environment. 

Comparison  of  the  capsule  and  flagellar  variable  surface 
antigens  in  B.cereus  group  bacteria 

It  has  been  noted  that  the  B.cereus  serotyping  based  on 
variable  surface  antigens,  flagellum  and  surface  polysacchar- 
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Figure  7.  Comparison  of  the  putative  B. cereus  capsule  biosynthesis  region.  The  colored  areas  between  the  two  regions  indicate  that  these  regions  share  a  high 
level  of  identity  (>10-5  BLAST,  >60%  amino  acid  identity).  A  conserved  region  consists  of  six  contiguous  genes,  including  a  chain  length-determining  protein 
and  polysaccharide  capsule  assembly  proteins  that  are  similar  to  putative  polysaccharide  capsule  biosynthesis  operons  in  B.subtilis  (75),  Oceanobacillus 
iheyensis  (78)  and  Lactococcus  lactis  subsp.  cremoris  (79).  However,  the  central  portion  of  the  polysaccharide  capsule  locus  consists  of  novel  genes  that 
encode  glycosyltransferases,  a  polysaccharide  polymerase  (BCE5389)  and  a  putative  translocase  (BCE5386).  Green  arrows  are  carbohydrate  utilization  genes; 
blue,  polysaccharide  polymerase;  dark  red,  chain  length-determining  protein;  red,  translocase  (membrane  protein);  gray,  conserved  hypothetical;  and  black, 
hypothetical. 


ides  generally  does  not  agree  with  genotyping  schemes  that 
are  based  on  conserved  chromosomal  markers  (23,57-59). 
Comparative  analysis  of  the  three  genomes  suggests  that 
surface  antigen  variability  may  be  generated  via  gene 
replacements,  duplications  and  deletions. 

Ivanova  et  al.  (24)  described  an  ~20  kb  region  of  B.  cereus 
ATCC  14579  encoding  proteins  (BC5263-5279)  thought  to  be 
involved  in  polysaccharide  capsule  synthesis  that  replaces  a 
B.anthracis  region  encoding  proteins  similar  to  teichoic  acid 
synthesis  proteins  (BA5505-20).  Bacillus  cereus  ATCC 
10987  also  contains  a  polysaccharide  capsule  gene  cluster  at 
this  locus  spanning  ~20  kb  and  encoding  21  putative  proteins 
(BCE5380-BCE5400).  Approximately"  half  of  the  20  kb 
replacement  in  B. cereus  ATCC  10987  contains  genes  similar 
to  the  B. cereus  ATCC  14579  capsule  locus  (Fig.  7). 
Conserved  regions  highlighted  in  Figure  7  may  have  allowed 
for  homologous  recombination,  resulting  in  the  variation 
observed  in  this  replacement. 

None  of  the  23  B.  cereus  group  strain  set  produced 
amplicons  by  PCR  using  primers  designed  to  the  B.  cereus 
ATCC  10987  polysaccharide  polymerase  (BCE5389)  or 
translocase  (BCE5386)  (Supplementary  table  S2).  Similarly, 
the  B.  cereus  ATCC  14579  polysaccharide  polymerase 
(BC5268)  was  identified  in  only  one  of  the  23  strains  tested, 
whereas  BC1588,  an  additional  putative  polysaccharide 
polymerase  in  this  strain,  was  present  in  14  strains 
(Supplementary  table  S2).  These  novel  genes  in  the  B. cereus 
ATCC  10987  locus  may  be  responsible  for  a  specialized 
structure  of  capsule  polysaccharide.  It  is  also  possible  that 
these  genes  may  influence  flagellar  structure:  homologs  of 
UDP-galactose  phosphate  transferase  (BCE5393)  and  an 
aminotransferase  family  protein  (BCE5394)  have  been 
shown  in  Campylobacter  jejuni  to  be  involved  in  the 
glycosylation  of  flagella  (60). 


We  can  find  no  record  of  any  description  of  a  complex 
extracellular  polysaccharide  capsule  produced  by  B.  cereus 
(1),  yet  the  formation  of  complex  biofilms  is  usually 
associated  with  the  expression  of  some  type  of  carbohydrate 
moiety,  and  B. cereus  biofilms  are  a  significant  problem  in 
the  dairy  industry  (61).  The  presence  of  polysaccharide 
capsule  gene  clusters  in  both  B. cereus  isolates  sequenced 
provides  evidence  that  these  structures  may  be  important  in 
environments  faced  by  the  B.  cereus  group  bacteria. 

The  flagellar  antigens  of  the  B.  cereus  group  are  another 
highly  variable  surface  structure,  with  up  to  82  groups  being 
described  using  serological  methods  (62-64).  Bacillus 
anthracis  genome  analysis  revealed  that  four  essential 
proteins  in  the  flagellar  gene  cluster  contained  point  mutations 
and  subsequent  frameshifts  (26)  rendering  the  flagellum  non¬ 
functional.  Lack  of  motility  is  often  cited  as  a  distinguishing 
factor  between  B.anthracis  and  other  B. cereus  group  members 
(65).  In  contrast,  both  B. cereus  genomes  sequenced  contained 
genes  encoding  full-length  proteins  (CheA  BCE  1749/ 
BC1628,  CheV  BCE1777/BC1654,  M-ring  protein  BCE1766/ 
BC1644  and  FliM  BCE1783/BC1662). 

Another  significant  difference  between  the  flagellar  gene 
cluster  of  B. cereus  and  B.anthracis  is  the  number  of  flagellin 
subunits  present.  The  B.anthracis  genome  contains  only  one 
flagellin  gene  (BA1706),  whereas  B. cereus  ATCC  14579 
contains  four  (BC1656-1659)  and  B. cereus  ATCC  10987  has 
two  (BCE1779  and  BCE1780).  Interestingly,  the  flagellin 
genes  are  transcribed  opposite  to  the  orientation  of  the  rest  of 
the  genes  in  the  flagellar  biosynthetic  cluster,  and  in  B.  cereus 
the  multiple  copies  are  clustered  together.  Amino  acid 
sequence  identities  of  the  flagellin  proteins  of  the  B.  cereus 
group  organisms  separate  these  proteins  into  two  groups;  one 
exclusive  to  B.  cereus  and  one  that  appears  common  to  all  three 
organisms  (Supplementary  fig.  S3).  It  is  possible  that  the 
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different  B.cereus  ATCC  10987  flagellins  are  expressed  under 
different  conditions  resulting  in  structurally,  functionally  and 
antigenically  variable  flagella. 

Conservation  of  the  PlcR  regulon  in  B.cereus  group 
genomes 

PlcR  is  a  pleiotropic  transcriptional  regulator  that  recognizes 
the  palindromic  sequence,  TATGNAN4TNCATA,  and  has 
been  implicated  in  the  control  of  a  number  of  virulence  factors 
in  B.cereus  and  B.thuringiensis  (66-68).  Slamti  and  Lereclus 
(69)  demonstrated  that  PlcR  activity  is  regulated  by  the 
presence  of  a  secreted  and  reimported  pentapeptide  produced 
from  the  processing  of  the  PapR  protein  C-terminus.  The  papR 
gene  itself  is  positively  regulated  by  PlcR,  forming  a  quorum 
sensing-like  system.  The  48  amino  acid  PapR  proteins  of 
B.cereus  ATCC  10987  and  B.anthracis  Ames  are  identical  and 
would  produce  the  same  regulatory  pentapeptide  (VPFEY), 
whereas  the  B.cereus  ATCC  14579  PapR  has  four  amino  acid 
changes,  one  of  which  is  present  in  the  secreted  pentapeptide 
(LPFEY). 

There  are  52  putative  PlcR-binding  motifs  in  the 
B.anthracis  genome  (24),  56  in  B.cereus  ATCC  14579  (26) 
and  57  in  B.cereus  ATCC  10987  which  potentially  regulate 
over  100  genes  in  each  isolate.  Comparative  analysis  reveals 
that  there  is  a  conserved  core  of  putative  PlcR-regulated 
proteins  present  in  all  three  genomes.  However,  a  number  of 
potentially  PlcR-regulated  proteins  are  present  only  in  both 
B.cereus  strains,  including  cytotoxin  K  (BCE1209),  non¬ 
hemolytic  enterotoxin  C  subunit  (BCE1970),  the  methyl- 
accepting  chemotaxis  protein  (BCE0638)  and  ribonucleotide- 
diphosphate  reductase,  whereas  an  aromatic  compound 
degradation  pathway  is  present  only  in  B.cereus  ATCC 
10987  (BCE2151-BCE2161). 

DISCUSSION 

Based  on  synteny  (Fig.  2),  overall  protein  and  nucleotide 
similarity  (Fig.  3),  phylogeny  (Fig.  1 )  and  shared  novel  genes 
(Fig.  4),  B.cereus  ATCC  10987  is  more  closely  related  to 
B.anthracis  Ames  (26)  than  it  is  to  another  dairy-isolated 
strain,  B.cereus  ATCC  14579  (27,28).  Although  this  may 
seem  initially  a  surprising  finding,  it  confirms  recent  MLEE 
and  other  studies  that  point  to  the  phylogenetic  intermingling 
of  species  in  the  B.cereus  group  (16-18,70).  B.cereus  ATCC 
10987  contains  a  number  of  characterized  virulence  factors 
such  as  the  non-hemolytic  enterotoxin  complex,  phospho¬ 
lipase  C,  sphingomyelinase  and  cytotoxin  K,  and  thus  has 
pathogenic  potential.  Additionally,  the  large  (-208  kb) 
plasmid  pBc  10987  shares  a  conserved  backbone  with 
B.anthracis  pXOl  (Fig.  2C),  which  may  contain  as  yet 
unidentified  conserved  plasmid  replication  and  maintenance 
functions.  pBc  10987  also  has  some  other  intriguing  parallels 
with  pXOl,  such  as  the  presence  of  a  transition  state  regulator 
homolog,  AbrB,  and  the  spore  coat  determinate,  BclA,  both  of 
which  have  been  demonstrated  to  play  a  role  in  pathogenesis. 

Although  B.cereus  ATCC  10987  overall  has  much  genetic 
similarity  to  the  other  two  B.cereus  group  genomes,  there  are 
clear  differences  in  gene  content  that  point  to  metabolic 
specializations  (e.g.  xylose  utilization  and  urease  genes;  Figs  5 
and  6)  and  surface  structure  variation  (capsule  and  flagellum 
genes;  Fig.  7  and  Supplementary  fig.  S3).  Some  gene 


movements  appear  to  have  been  mediated  by  insertion  of 
genes  via  phages  or  insertion  elements  [i.e.  Tn.554  element  or 
novel  phage  (Table  2)],  previously  observed  in  bacteria  where 
several  closely  related  genomes  have  been  sequenced  (71-74). 
However,  many  replacements  in  B.cereus  ATCC  10987  do  not 
appear  to  be  associated  with  mobile  genetic  elements, 
suggesting  that  either  the  insertion  has  taken  place  through 
homologous  recombination  of  flanking  DNA  or  the  mobile 
elements  are  no  longer  identifiable  due  to  sequence  divergence 
or  deletion.  A  recent  MLST  study  on  77  B.cereus  group 
organisms  demonstrated  that  recombination  in  seven  house¬ 
keeping  genes  was  occurring  at  a  low  level  in  the  B.cereus 
group  (18);  however,  the  frequency  of  horizontal  transfer 
among  genes  required  for  adaptation  to  new  environments 
may  well  be  much  higher. 

This  concept  raises  a  number  of  intriguing  questions  that 
should  be  subjected  to  further  analysis.  How  does  DNA  enter 
the  cell  in  a  natural  situation?  Why  are  multiple  gene  clusters 
found  at  similar  loci  (e.g.  xylose  genes  in  B.cereus  ATCC 
10987  and  nitrate  reductase  genes  in  B.anthracis  and  B.  cereus 
ATCC  14579):  are  these  hotspots  for  recombination?  What  are 
the  roles  of  the  restriction-modification  systems  and  other 
potential  barriers  to  the  flow  of  genetic  information?  Each  of 
the  B.cereus  group  organisms  contains  a  number  of  unique 
restriction-modification  systems  (Table  2).  These  bacteria  can 
be  genetically  manipulated  by  electroporation,  transconjuga¬ 
tion  and  other  methods  with  much  effort,  but  may  have  a 
natural  mechanism  for  acquiring  DNA. 

A  sample  of  three  genomes  of  the  B.cereus  group,  common 
across  the  globe  and  adapted  to  numerous  specific  environ¬ 
ments,  is  not  sufficient  to  begin  to  understand  the  dynamics  of 
genome  evolution  or  to  even  make  any  generalized  statements 
with  great  conviction.  For  instance,  it  is  interesting  to  note  that 
when  the  B.anthracis  Ames  genome  was  completed,  some 
traits  were  labeled  as  B.anthracis- specific  but  now  with  the 
two  B.cereus  genome  sequences  need  to  be  considered  as 
B.cereus  group-specific.  An  example  of  this  is  genetic 
competence:  all  three  B.cereus  group  organisms  lack  similar 
genes  that  have  been  shown  in  B.subtilis  to  be  required  for  full 
genetic  competence  (75,76)  (Supplemetary  fig.  S4).  Yet  are 
we  really  sure  that  these  homologs  are  not  present  in  some, 
unsequenced,  B.cereus  group  strains  or  present  with  low-level 
homology  and  currently  labeled  as  hypothetical?  These 
questions  can  be  addressed  with  techniques  such  as  suppres¬ 
sive  subtractive  hybridization,  plasmid  and  phage  sequencing 
(which  often  contain  novel,  niche-specific  genes)  and  com¬ 
parative  genomic  hybridizations  using  microarrays. 
Inevitably,  however,  whole-genome  sequencing  of  key  strains 
in  phylogenetically  relevant  subgroups  of  B.cereus  sensu  lato , 
such  as  pathogenic  B.cereus  from  periodontal,  neonatal  or 
immunocompromised  patient  sources,  is  going  to  be  the 
workhorse  of  discovery  in  the  near  future. 
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